PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG044087t1
Common NameTCM_044087
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Trihelix
Protein Properties Length: 565aa    MW: 64658.9 Da    PI: 6.4649
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG044087t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.16.8e-30119204187
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm.rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                       rW++qe+l+L+++r++++++++++++k+plW+evs+ m +e+g++rs k+C+ek+enl k+ykk+keg+ +r  ++ +++++f+qlea
  Thecc1EG044087t1 119 RWPRQETLTLLDIRSRLDSKFKEANQKGPLWDEVSRIMaEEHGYQRSGKKCREKFENLYKYYKKTKEGKAGR--QDGKNYRFFRQLEA 204
                       8*************************************999*****************************97..66678*******85 PP

2trihelix66.84.4e-21417502186
          trihelix   1 rWtkqevlaLiearremeerlrr.gklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                       rWt++ev  Li++r++ e+r+++ g +k+ lWee+ +km   g+er +++Ckekw+n+++++  ++e  kkr +e+ ++  yf+ l+
  Thecc1EG044087t1 417 RWTEHEVSSLIQLRKSFESRFQDaGYSKESLWEEIEAKMVGLGYERDAVECKEKWDNMQMYFNMTTECYKKR-KEDFRSSNYFQLLD 502
                       8*********************7478*********************************************8.56666889999876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.76112177IPR017877Myb-like domain
SMARTSM007171.2116179IPR001005SANT/Myb domain
PfamPF138378.5E-21118205No hitNo description
CDDcd122031.54E-20118184No hitNo description
PROSITE profilePS500907.538410475IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.606.6E-5411474IPR009057Homeodomain-like
SMARTSM007170.039414477IPR001005SANT/Myb domain
PfamPF138372.0E-16416503No hitNo description
CDDcd122036.37E-21416482No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 565 aa     Download sequence    Send to blast
MEMGDQYGLP DLRQFLARGT HFPDTPQPSE PCFTHTHRNM APLAPYHEAF MVSNGMAVPS  60
SLIRFGHDHF AGASATTTAI AASASSAAAS GPCAALFGVE MESSGIGWSL GNIEGGNSRW  120
PRQETLTLLD IRSRLDSKFK EANQKGPLWD EVSRIMAEEH GYQRSGKKCR EKFENLYKYY  180
KKTKEGKAGR QDGKNYRFFR QLEALYGETS NQSSLLETNL AQRTLLCQTP NNTMNQENQE  240
FLQEQKLSES LTFSNASEFE TSSSENNDDD LSAIAFMMKQ SMVEKQKSIN ESGSSSRVKK  300
GWKTKVKDFV ESQMKKLIDS QDMWMERMLK AIDDKERERV SKEEEWRRQE AARFDKEHEF  360
WAKERSWVEA RDAALLDVLK KFTAGKGLEV SSSAEAPVIT ETHSHNKNQQ DAINTNRWTE  420
HEVSSLIQLR KSFESRFQDA GYSKESLWEE IEAKMVGLGY ERDAVECKEK WDNMQMYFNM  480
TTECYKKRKE DFRSSNYFQL LDSCDGQENN TNTVKQRDSP SNSYVGTHQQ LQDTNSFQIA  540
VHQGDQRLWD RYGLKLGKGK NQQI*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007010380.10.0Transcription factor, putative
TrEMBLA0A061FPI80.0A0A061FPI8_THECC; Transcription factor, putative
STRINGPOPTR_0008s02580.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM27672871
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G03680.11e-112Trihelix family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]